AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Giulia Denevi, Carlo Ciliberto, Dimitris Stamos, Massimiliano Pontil

Learning To Learn Around A Common Mean

Neural Information Processing SystemsNov-20-2025, 19:37:09 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (14 more...)

Country:

North America > United States (0.28)
Europe > United Kingdom > England > Greater London > London (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Italy > Liguria > Genoa (0.04)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.65)

Neural Information Processing SystemsOct-2-2025, 00:21:59 GMT

0a716fe8c7745e51a3185fc8be6ca23a-Paper.pdf

algorithm, artificial intelligence, machine learning, (13 more...)

Country: Europe (0.28)

Genre: Research Report (0.46)

Industry: Education > Educational Setting (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Giulia Denevi, Dimitris Stamos, Carlo Ciliberto, Massimiliano Pontil

Online-Within-Online Meta-Learning

Neural Information Processing SystemsAug-20-2025, 06:18:01 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, cumulative error, learning, (14 more...)

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Genre:

Instructional Material > Online (0.50)
Workflow (0.46)

Industry: Education > Educational Setting (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsOct-11-2024, 02:57:44 GMT

Online-Within-Online Meta-Learning

We study the problem of learning a series of tasks in a fully online Meta-Learning setting. The goal is to exploit similarities among the tasks to incrementally adapt an inner online algorithm in order to incur a low averaged cumulative error over the tasks. We focus on a family of inner algorithms based on a parametrized variant of online Mirror Descent. The inner algorithm is incrementally adapted by an online Mirror Descent meta-algorithm using the corresponding within-task minimum regularized empirical risk as the meta-loss. In order to keep the process fully online, we approximate the meta-subgradients by the online inner algorithm.

inner algorithm, online mirror descent, online-within-online meta-learning, (1 more...)

Genre: Instructional Material > Online (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Osadchiy, Ilya, Levy, Kfir Y., Meir, Ron

Online Meta-Learning in Adversarial Multi-Armed Bandits

arXiv.org Machine LearningJul-12-2022

We study meta-learning for adversarial multi-armed bandits. We consider the online-within-online setup, in which a player (learner) encounters a sequence of multi-armed bandit episodes. The player's performance is measured as regret against the best arm in each episode, according to the losses generated by an adversary. The difficulty of the problem depends on the empirical distribution of the per-episode best arm chosen by the adversary. We present an algorithm that can leverage the non-uniformity in this empirical distribution, and derive problem-dependent regret bounds. This solution comprises an inner learner that plays each episode separately, and an outer learner that updates the hyper-parameters of the inner algorithm between the episodes. In the case where the best arm distribution is far from uniform, it improves upon the best bound that can be achieved by any online algorithm executed on each episode individually without meta-learning.

artificial intelligence, data mining, machine learning, (19 more...)

2205.15921

Country:

Asia > Middle East > Israel > Haifa District > Haifa (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Instructional Material > Online (0.40)
Research Report (0.40)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Luo, Yiyun, Sun, Will Wei, Liu, and Yufeng

Distribution-free Contextual Dynamic Pricing

arXiv.org Machine LearningSep-15-2021

Contextual dynamic pricing aims to set personalized prices based on sequential interactions with customers. At each time period, a customer who is interested in purchasing a product comes to the platform. The customer's valuation for the product is a linear function of contexts, including product and customer features, plus some random market noise. The seller does not observe the customer's true valuation, but instead needs to learn the valuation by leveraging contextual information and historical binary purchase feedbacks. Existing models typically assume full or partial knowledge of the random noise distribution. In this paper, we consider contextual dynamic pricing with unknown random noise in the valuation model. Our distribution-free pricing policy learns both the contextual function and the market noise simultaneously. A key ingredient of our method is a novel perturbed linear bandit framework, where a modified linear upper confidence bound algorithm is proposed to balance the exploration of market noise and the exploitation of the current knowledge for better pricing. We establish the regret upper bound and a matching lower bound of our policy in the perturbed linear bandit framework and prove a sub-linear regret bound in the considered pricing problem. Finally, we demonstrate the superior performance of our policy on simulations and a real-life auto-loan dataset.

linear bandit, pricing, pricing problem, (15 more...)

2109.0734

Country:

North America > United States > California (0.04)
North America > United States > North Carolina (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.86)

Industry:

Banking & Finance (0.67)
Health & Medicine > Therapeutic Area (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)

Denevi, Giulia, Pontil, Massimiliano, Ciliberto, Carlo

The Advantage of Conditional Meta-Learning for Biased Regularization and Fine-Tuning

arXiv.org Machine LearningAug-25-2020

Biased regularization and fine-tuning are two recent meta-learning approaches. They have been shown to be effective to tackle distributions of tasks, in which the tasks' target vectors are all close to a common meta-parameter vector. However, these methods may perform poorly on heterogeneous environments of tasks, where the complexity of the tasks' distribution cannot be captured by a single meta-parameter vector. We address this limitation by conditional meta-learning, inferring a conditioning function mapping task's side information into a meta-parameter vector that is appropriate for that task at hand. We characterize properties of the environment under which the conditional approach brings a substantial advantage over standard meta-learning and we highlight examples of environments, such as those with multiple clusters, satisfying these properties. We then propose a convex meta-algorithm providing a comparable advantage also in practice. Numerical experiments confirm our theoretical findings.

algorithm, artificial intelligence, machine learning, (16 more...)

2008.10857

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry: Education > Educational Setting (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Denevi, Giulia, Stamos, Dimitris, Ciliberto, Carlo, Pontil, Massimiliano

Online-Within-Online Meta-Learning

Neural Information Processing SystemsMar-19-2020, 02:02:08 GMT

We study the problem of learning a series of tasks in a fully online Meta-Learning setting. The goal is to exploit similarities among the tasks to incrementally adapt an inner online algorithm in order to incur a low averaged cumulative error over the tasks. We focus on a family of inner algorithms based on a parametrized variant of online Mirror Descent. The inner algorithm is incrementally adapted by an online Mirror Descent meta-algorithm using the corresponding within-task minimum regularized empirical risk as the meta-loss. In order to keep the process fully online, we approximate the meta-subgradients by the online inner algorithm.

inner algorithm, online mirror descent, online-within-online meta-learning, (1 more...)

Genre: Instructional Material > Online (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Wang, Ruohan, Demiris, Yiannis, Ciliberto, Carlo

A Structured Prediction Approach for Conditional Meta-Learning

arXiv.org Machine LearningFeb-20-2020

Optimization-based meta-learning algorithms are a powerful class of methods for learning-to-learn applications such as few-shot learning. They tackle the limited availability of training data by leveraging the experience gained from previously observed tasks. However, when the complexity of the tasks distribution cannot be captured by a single set of shared meta-parameters, existing methods may fail to fully adapt to a target task. We address this issue with a novel perspective on conditional meta-learning based on structured prediction. We propose task-adaptive structured meta-learning (TASML), a principled estimator that weighs meta-training data conditioned on the target task to design tailored meta-learning objectives. In addition, we introduce algorithmic improvements to tackle key computational limitations of existing methods. Experimentally, we show that TASML outperforms state-of-the-art methods on benchmark datasets both in terms of accuracy and efficiency. An ablation study quantifies the individual contribution of model components and suggests useful practices for meta-learning.

algorithm, meta-learning, tasml, (12 more...)